The Wisdom of Minority: Unsupervised Slot Filling Validation based on Multi-dimensional Truth-Finding

نویسندگان

  • Dian Yu
  • Hongzhao Huang
  • Taylor Cassidy
  • Heng Ji
  • Chi Wang
  • Shi Zhi
  • Jiawei Han
  • Clare R. Voss
  • Malik Magdon-Ismail
چکیده

Information Extraction using multiple information sources and systems is beneficial due to multisource/system consolidation and challenging due to the resulting inconsistency and redundancy. We integrate IE and truth-finding research and present a novel unsupervised multi-dimensional truth finding framework which incorporates signals from multiple sources, multiple systems and multiple pieces of evidence by knowledge graph construction through multi-layer deep linguistic analysis. Experiments on the case study of Slot Filling Validation demonstrate that our approach can find truths accurately (9.4% higher F-score than supervised methods) and efficiently (finding 90% truths with only one half the cost of a baseline without credibility estimation).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

RPI-BLENDER TAC-KBP2013 Knowledge Base Population System

This year the RPI-BLENDER team participated in the following four tasks: English Entity Linking, Regular Slot Filling, Temporal Slot Filling and Slot Filling Validation. The major improvement was made for Regular Slot Filling and Slot Filling validation. We developed a fresh system for both tasks. Our approach embraces detailed linguistic analysis and knowledge discovery, and advanced knowledge...

متن کامل

Unsupervised Person Slot Filling based on Graph Mining

Slot filling aims to extract the values (slot fillers) of specific attributes (slots types) for a given entity (query) from a largescale corpus. Slot filling remains very challenging over the past seven years. We propose a simple yet effective unsupervised approach to extract slot fillers based on the following two observations: (1) a trigger is usually a salient node relative to the query and ...

متن کامل

High-Dimensional Unsupervised Active Learning Method

In this work, a hierarchical ensemble of projected clustering algorithm for high-dimensional data is proposed. The basic concept of the algorithm is based on the active learning method (ALM) which is a fuzzy learning scheme, inspired by some behavioral features of human brain functionality. High-dimensional unsupervised active learning method (HUALM) is a clustering algorithm which blurs the da...

متن کامل

Stacked Ensembles of Information Extractors for Knowledge-Base Population by Combining Supervised and Unsupervised Approaches

The UTAustin team participated in two main tasks this year the Cold Start Slot Filling (CSSF) task and the Slot-Filler Validation/Ensembling task, which was divided into the filtering and ensembling subtasks. Our system uses stacking to ensemble multiple systems for the KBP slot filling task, as described in our ACL 2015 paper. We expand the stacking approach by allowing the classifier to also ...

متن کامل

Using a weakly supervised approach and lexical patterns for the KBP slot filling task

We present in this article the system we developed for participating to the slot filling task in the Knowledge Base Population (KBP) track of the 2011 Text Analysis Conference (TAC). This system is based on a weakly supervised approach and lexical patterns. In this participation, we tested more specifically the integration of an additional unsupervised relation identification component dedicate...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014